Mapping Phrase Structures to Dependency Structures in the Case of (Partially) Free Word Order Languages
نویسنده
چکیده
Les corpus sont très utiles pour de nombreuses tâches dans le domaine du traitement automatique des langues naturelles. Les corpus annotés syntaxiquement sont devenus une ressource importante en TAL. Ils sont couramment utilisés, par exemple comme banc d’essai pour la génération, l’analyse et la désambiguı̈sation sémantique, et comme source pour l’acquisition de ressources (collocations, information sur la sous-catégorisation, extraction de grammaire). Lorsqu’on utilise les structures de dépendance pour le TAL, le manque de corpus annotés en structures de dépendance constitue un handicap. Nous présentons une approche fondée sur une grammaire de graphes pour convertir des corpus annotés en structures syntagmatiques en corpus annotés en dépendances. Cette approche fonctionne pour des langues à ordre de mots (partiellement) libre et fixe.
منابع مشابه
Feature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کاملConverting Phrase Structures to Dependency Structures in Sanskrit
Two annotations schemes for presenting the parsed structures are prevalent viz. the constituency structure and the dependency structure. While the constituency trees mark the relations due to positions, the dependency relations mark the semantic dependencies. Free word order languages like Sanskrit pose more problems for constituency parses since the elements within a phrase are dislocated. In ...
متن کاملDependency-Based Hybrid Model of Syntactic Analysis for the Languages with a Rather Free Word Order
Although phrase structure grammars have turned out to be a more popular approach for analysis and representation of the natural language syntactic structures, dependency grammars are often considered as being more appropriate for free word order languages. While building a parser for Latvian, a language with a rather free word order, we found (similarly to TIGER project for German and Talbanken...
متن کاملتأثیر ساختواژهها در تجزیه وابستگی زبان فارسی
Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...
متن کاملTAG and Topology
Classical phrase structure tries to collapse syntactic and ordering information. However, this conception of the syntax of language is erroneous because it supposes that word order is always an immediate reflection of the syntactic hierarchy and that any deviation from this constitutes a problem, denoted by terms with negative undertones like scrambling. Modern linguistic frameworks propose a d...
متن کامل